Search CORE

11 research outputs found

Structured Dictionary Learning for Energy Disaggregation

Author: Gardner Gerald T
Inc.
Kelly Jack
Mavrokefalidis Christos
Park Jaehyun
Tomkins Sabina
Zhang Chaoyun
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/07/2019
Field of study

The increased awareness regarding the impact of energy consumption on the environment has led to an increased focus on reducing energy consumption. Feedback on the appliance level energy consumption can help in reducing the energy demands of the consumers. Energy disaggregation techniques are used to obtain the appliance level energy consumption from the aggregated energy consumption of a house. These techniques extract the energy consumption of an individual appliance as features and hence face the challenge of distinguishing two similar energy consuming devices. To address this challenge we develop methods that leverage the fact that some devices tend to operate concurrently at specific operation modes. The aggregated energy consumption patterns of a subgroup of devices allow us to identify the concurrent operating modes of devices in the subgroup. Thus, we design hierarchical methods to replace the task of overall energy disaggregation among the devices with a recursive disaggregation task involving device subgroups. Experiments on two real-world datasets show that our methods lead to improved performance as compared to baseline. One of our approaches, Greedy based Device Decomposition Method (GDDM) achieved up to 23.8%, 10% and 59.3% improvement in terms of micro-averaged f score, macro-averaged f score and Normalized Disaggregation Error (NDE), respectively.Comment: 10 Page

arXiv.org e-Print Archive

Crossref

Predicting Academic Performance: A Systematic Literature Review

Author: Abdulwahhab R. S.
Aggarwal H.W.
Agudo-Peregrina Ángel F
al Rifaie Mohammad Majid
Allan Fransiskus
Almuniri Ismail
Ashenafi Michael Mogessie
Aziz Fatihah
Baker Ryan SJD
Barba-Guamán L.
Bayer Jaroslav
Blei David M.
Bydžovská Hana
Cengiz Nihat
Chan L.
Chaturvedi R.
Chen YY
Choi D.S.
Chunqiao Mi
Collura Michael A
Corsatea B. M.
Cribbs Jennifer D
Deliz José R
DeMonbrun R.M.
Dávila Saylisse
Edmundo
Evale Digna S
Fincher Sally
Gil-Herrera Eleazar
Gray Geraldine
Güner Necdet
Haig Thomas
Han M.
Ho Chia-Lin
Hornik Kurt
Howell Larry L
Hu Qian
Huang Shaobo
Huang Yun
Imbrie PK
Jiang Suhang
Jove E.
Kai Shimin
Kaur P.
Kentli Fulya Damla
Kuehn M.
Kumar A. Dinesh
Kumar Mukesh
Luo Jingyi
Luo Ling
Manoharan J James
Mashiloane Lebogang
Mayilvaganan M
Mhetre V.
Moradi F.
Morsy S.
Nedungadi Prema
Ninrutsirikun U.
Paimin Aini Nazura
Paimin Aini Nazura
Palmer Stuart
Pandey Mrinal
Papamitsiou Zacharoula
Pardos Zachary A
Patrick A Borrego
Pushpa S.K.
Raman D.R.
Ramanathan L.
Ramirez Nichole
Raura G.
Raymond Ting Siu-Man
Reid Kenneth
Reisberg Rachelle
Ren Zhiyun
Rhodes Nicholas
Ringenberg Jeff
Sadati S.
Sadler William E
Schar Mark
Sievert Carson
Sivasakthi M.
Sorby Sheryl
Strawderman Lesley
Sugiharti E.
Tieu Hoang
Tomkins Sabina
Tsalatsanis Athanasios
Uswatun Annisa
Verma S.K.
Vihavainen Arto
Vogt Christina
Wang Feng
Wolff Thomas F
Wu Xinhui
Wyk Barend Van
Yang T.-Y.
Yeh Her-Tyan
Zhu Ke
Publication venue: ACM
Publication date: 01/01/2018
Field of study

The ability to predict student performance in a course or program creates opportunities to improve educational outcomes. With effective performance prediction approaches, instructors can allocate resources and instruction more accurately. Research in this area seeks to identify features that can be used to make predictions, to identify algorithms that can improve predictions, and to quantify aspects of student performance. Moreover, research in predicting student performance seeks to determine interrelated features and to identify the underlying reasons why certain features work better than others. This working group report presents a systematic literature review of work in the area of predicting student performance. Our analysis shows a clearly increasing amount of research in this area, as well as an increasing variety of techniques used. At the same time, the review uncovered a number of issues with research quality that drives a need for the community to provide more detailed reporting of methods and results and to increase efforts to validate and replicate work.Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

Monash University Research Portal

Genome sequencing reveals Zika virus diversity and spread in the Americas

Author: A Piantadosi
A Rambaut
A-C Gourinat
AA Sall
Aaron E. Lin
Adrianne Gladden-Young
AJ Drummond
AJ Drummond
AJ Drummond
Amanda L. Tan
Andreas Gnirke
Andrew C. Cannons
Andrew Rambaut
AS Fauci
B Shapiro
B Shapiro
BEE Martina
Brandon Sabina
Bridget Chak
Bronwyn L. MacInnis
C Aurrecoechea
Carolyn M. Barcellona
Catherine A. Freije
Catherine M. Brown
CB Matranga
Chalmers Vasquez
Christian B. Matranga
Christopher H. Tomkins-Tinch
CJ Villabona-Arenas
Clarissa Valim
Cynthia Y. Luo
D Hyatt
D Tappe
Daniel J. Park
DE Wood
Diana P. Rojas
DJ Park
Edgar W. Kopp
Edson Delatorre
F Cribari-Neto
Fernando A. Bozza
G Baele
G Baele
Gabriela Paz-Bailey
Giselle Barbosa-Lima
Glenn Oliveira
H Li
Hayden C. Metsky
Irene Bosch
Ivette Lorenzana
J Josse
James Qu
JD Morlan
Jose Cerbino-Neto
Joshua J. Anzinger
JR Brister
JS Schieffelin
K Clark
K Katoh
KA Tsetsarkin
Karthik Gangavarapu
Kayla G. Barnes
Kelly N. Hogan
Kendra West
Kimberly F. Garcia
Kristian G. Andersen
L Fu
Lauren M. Paul
Leda A. Parham
Lee Gehrke
Luis Villar
M Kearse
M Worobey
M. Elizabeth Halloran
MA Brinton
MAR Ferreira
Maria C. Miranda Montoya
Mario C. Porcelli
Marshall R. Cone
Mary Lynn Baniecki
MG Grabherr
MND Balm
MR Reynolds
MRT Nunes
NA O’Leary
Nathan D. Grubaugh
Nathan L. Yozwiak
NR Faria
O Faye
O Faye
P Yarza
Pardis C. Sabeti
Patricia T. Bozza
Refugio Robles-Sikisaka
Rickey R. Shah
Rosa M. Gélvez Ramírez
RS Lanciotti
S Duchêne
S Guindon
S Henikoff
S Lê
Salim Mattar
Sandra Smole
Sarah M. Winnicki
Sarah Scotland
Scott F. Michael
Scott Hennigan
Sharon Isern
Shirlee Wohl
SI Sardi
Simon H. Ye
SK Gire
Stephen F. Schaffner
Thiago M. L. Souza
Wim Degrave
Yasmine R. Vieira
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Although the recent Zika virus (ZIKV) epidemic in the Americas and its link to birth defects have attracted a great deal of attention, much remains unknown about ZIKV disease epidemiology and ZIKV evolution, in part owing to a lack of genomic data. Here we address this gap in knowledge by using multiple sequencing approaches to generate 110 ZIKV genomes from clinical and mosquito samples from 10 countries and territories, greatly expanding the observed viral genetic diversity from this outbreak. We analysed the timing and patterns of introductions into distinct geographic regions; our phylogenetic evidence suggests rapid expansion of the outbreak in Brazil and multiple introductions of outbreak strains into Puerto Rico, Honduras, Colombia, other Caribbean islands, and the continental United States. We find that ZIKV circulated undetected in multiple regions for many months before the first locally transmitted cases were confirmed, highlighting the importance of surveillance of viral infections. We identify mutations with possible functional implications for ZIKV biology and pathogenesis, as well as those that might be relevant to the effectiveness of diagnostic tests

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

DSpace@MIT

ResearchOnline at James Cook University

Edinburgh Research Explorer

Recommended from our members

Probabilistic Methods for Data-Driven Social Good

Author: Tomkins Sabina
Publication venue: eScholarship, University of California
Publication date: 01/01/2018
Field of study

Computational techniques have much to offer in addressing questions of societal significance. Many such question can be framed as prediction problems, and approached with data-driven methods. In addition to prediction, understanding human behavior is a distinguishing goal in societally-relevant domains. In this work, I describe societally-significant problems which can be solved with a collective probabilistic approach.These problems pose many challenges to techniques which assume data independence, homogeneity and scale. In settings of societal importance, dependencies can define the data in question; from complex relationships between people, to continuity between consecutive events. Rather than being generated by single, uniform sources, data in these domains can be derived and described by heterogenous sources. Finally, though many data-driven methods depend on large amounts of observations and high-quality labels in order to guarantee quality results, in domains of critical social value it is often infeasible to gather such quantities. These challenges demand methods which can utilize data-dependencies, incorporate diverse forms of information and reason over small numbers of instances with potentially ambiguous labels.There are also many opportunities in these domains. Models concerned with societally relevant problems can draw from the knowledge established by existing academic disciplines, from the social to the natural sciences. Such knowledge can serve to inform each step of research from choosing an appropriate problem to putting results into perspective. Furthermore, there are opportunities to obtain new insights into human behavior with the abundance of data generated by virtual and online activity, and mobile and sensor networks. The scale of this data necessitates computational methods. Methods which can leverage prior knowledge and remain efficient even with large datasets can offer much in these domains.In my work I utilize a collective probabilistic approach for data-driven social good. This approach can capitalize on structure between data instances, rather than flattening it. Furthermore, it can readily incorporate domain knowledge which, especially when combined with a collective approach, is instrumental in learning from small datasets. When datasets are large, this approach leverages a class of probabilistic graphical model which offers efficient inference. Finally, this approach can be extended to model unobserved phenomena with latent-variable representations.I demonstrate the benefits of this approach in three societally-relevant domains, sustainability, education and malicious behavior. While these domains are diverse, the problems they present share several commonalities which are critical in data-driven modeling. For example, modeling data structure, from spatial relationships to social interactions, can reduce issues of sparsity and noise. Domain knowledge can also combat these issues, in addition to improving model interpretability. I show the benefits of domain knowledge in discovering sustainable products, predicting course performance and detecting cyberbullying. In both the domain of sustainability and malicious behavior, I demonstrate how to utilize spatio-temporal structure in the seemingly distinct tasks of disaggregating appliances and predicting the movements of human traffickers. In education and malicious behavior, I show how unobserved social structure is instrumental in not only modeling learning and aggression, but in interpreting these dynamics in groups. In all three domains I show how to model, represent and interpret latent structure. Thus, while making contributions to each problem setting and domain, I also contribute to the broader goal of data-driven modeling for social good

eScholarship - University of California

Probabilistic Methods for Data-Driven Social Good

Author: Tomkins Sabina
Publication venue: eScholarship, University of California
Publication date: 01/01/2018
Field of study

Ezid

eScholarship - University of California

Same data, different conclusions: Radical dispersion in empirical results when independent analysts operationalize and test the same hypothesis

Author: Schweinsberg Martin
Feldman Michael
Staub Nicola
van den Akker Olmo R
van Aert Robbie CM
van Assen Marcel ALM
Liu Yang
Althoff Tim
Heer Jeffrey
Kale Alex
Mohamed Zainab
Amireh Hashem
Prasad Vaishali Venkatesh
Bernstein Abraham
Robinson Emily
Snellman Kaisa
Sommer S Amy
Otner Sarah MG
Robinson David
Madan Nikhil
Silberzahn Raphael
Goldstein Pavel
Tierney Warren
Murase Toshio
Mandl Benjamin
Viganola Domenico
Strobl Carolin
Schaumans Catherine BC
Kelchtermans Stijn
Naseeb Chan
Garrison S Mason
Yarkoni Tal
Chan CS Richard
Prestone Adie
Alaburda Paulius
Albers Casper
Alspaugh Sara
Alstott Jeff
Nelson Andrew A
Ariño de la Rubia Eduardo
Arzi Adbi
Bahník Štěpán
Baik Jason
Balling Laura Winther
Banker Sachin
Baranger David AA
Barr Dale J
Barros-Rivera Brenda
Bauer Matt
Blaise Enuh
Boelen Lisa
Bohle Carbonell Katerina
Briers Robert A
Burkhard Oliver
Canela Miguel-Angel
Castrillo Laura
Catlett Timothy
Chen Olivia
Clark Michael
Cohn Brent
Coppock Alex
Cugueró-Escofet Natàlia
Curran Paul G
Cyrus-Lai Wilson
Dai David
Dalla Riva Giulio Valentino
Danielsson Henrik
de F S M Russo Rosaria
de Silva Niko
Derungs Curdin
Dondelinger Frank
Duarte de Souza Carolina
Dube B Tyson
Dubova Marina
Dunn Ben Mark
Edelsbrunner Peter Adriaan
Finley Sara
Fox Nick
Gnambs Timo
Gong Yuanyuan
Grand Erin
Greenawalt Brandon
Han Dan
Hanel Paul HP
Hong Antony B
Hood David
Hsueh Justin
Huang Lilian
Hui Kent N
Hultman Keith A
Javaid Azka
Jiang Lily Ji
Jong Jonathan
Kamdar Jash
Kane David
Kappler Gregor
Kaszubowski Erikson
Kavanagh Christopher M
Khabsa Madian
Kleinberg Bennett
Kouros Jens
Krause Heather
Krypotos Angelos-Miltiadis
Lavbič Dejan
Lee Rui Ling
Leffel Timothy
Lim Wei Yang
Liverani Silvia
Loh Bianca
Lønsmann Dorte
Low Jia Wei
Lu Alton
MacDonald Kyle
Madan Christopher R
Madsen Lasse Hjorth
Maimone Christina
Mangold Alexandra
Marshall Adrienne
Matskewich Helena Ester
Mavon Kimia
McLain Katherine L
McNamara Amelia A
McNeill Mhairi
Mertens Ulf
Miller David
Moore Ben
Moore Andrew
Nantz Eric
Nasrullah Ziauddin
Nejkovic Valentina
Nell Colleen S
Nelson Andrew Arthur
Nilsonne Gustav
Nolan Rory
O’Brien Christopher E
O’Neill Patrick
O’Shea Kieran
Olita Toto
Otterbacher Jahna
Palsetia Diana
Pereira Bianca
Pozdniakov Ivan
Protzko John
Reyt Jean-Nicolas
Riddle Travis
Ridhwan Omar Ali Amal Akmal
Ropovik Ivan
Rosenberg Joshua M
Rothen Stephane
Schulte-Mecklenbeck Michael
Sharma Nirek
Shotwell Gordon
Skarzynski Martin
Stedden William
Stodden Victoria
Stoffel Martin A
Stoltzman Scott
Subbaiah Subashini
Tatman Rachael
Thibodeau Paul H
Tomkins Sabina
Valdivia Ana
Druijff-van de Woestijne Gerrieke B
Viana Laura
Villesèche Florence
Wadsworth W Duncan
Wanders Florian
Watts Krista
Wells Jason D
Whelpley Christopher E
Won Andy
Wu Lawrence
Yip Arthur
Youngflesh Casey
Yu Ju-Chi
Zandian Arash
Zhang Leilei
Zibman Chava
Uhlmann Eric Luis
Publication venue: 'Elsevier BV'
Publication date: 01/06/2008
Field of study

In this crowdsourced initiative, independent analysts used the same dataset to test two hypotheses regarding the effects of scientists’ gender and professional status on verbosity during group meetings. Not only the analytic approach but also the operationalizations of key variables were left unconstrained and up to individual analysts. For instance, analysts could choose to operationalize status as job title, institutional ranking, citation counts, or some combination. To maximize transparency regarding the process by which analytic choices are made, the analysts used a platform we developed called DataExplained to justify both preferred and rejected analytic paths in real time. Analyses lacking sufficient detail, reproducible code, or with statistical errors were excluded, resulting in 29 analyses in the final sample. Researchers reported radically different analyses and dispersed empirical outcomes, in a number of cases obtaining significant effects in opposite directions for the same research question. A Boba multiverse analysis demonstrates that decisions about how to operationalize variables explain variability in outcomes above and beyond statistical choices (e.g., covariates). Subjective researcher decisions play a critical role in driving the reported empirical results, underscoring the need for open data, systematic robustness checks, and transparency regarding both analytic paths taken and not taken. Implications for organizations and leaders, whose decision making relies in part on scientific findings, consulting reports, and internal analyses by data scientists, are discussed

Repositorio Institucional Universidad de Granada

Edinburgh Research Explorer

Caltech Authors

Digitala Vetenskapliga Arkivet - Academic Archive On-line

espace@Curtin

Tilburg University Repository

University of Essex Research Repository

Repository for Publications and Research Data

Publikationer från Linköpings universitet

Crossref

Repository@Nottingham

ZORA